Implementing Linguistic Query Languages Using LoToS

نویسندگان

  • Lukas C. Faulstich
  • Ulf Leser
چکیده

A linguistic database is a collection of texts where sentences and words are annotated with linguistic information, such as part of speech, morphology, and syntactic sentence structure. While early linguistic databases focused on word annotations, and later also on parse-trees of sentences (so-called treebanks), the recent years have seen a growing interest in richly annotated corpora of historic texts that include not only syntactic annotations but further complex annotations, such as alignments between related text layers. This raises the issue of efficiently querying such complex structured linguistic databases. We present a generic approach for defining domain-specific query languages that we use in developing a query language for richly annotated historic corpora. In our approach, a query language is defined as a set of predicates. A query in form of a logic rule is translated by our LoToS query compiler into a single, possibly deeply nested SQL query. In contrast to previous approaches, the annotation structures that can be queried need not be trees but can also form DAGs, or, for a restricted class of recursive queries, arbitrary graphs. To this end, LoToS offers an operator for computing transitive closures using the recursive capabilities of modern database systems. We believe that this is the first approach to use modern SQL capabilities for evaluating recursive predicates in logic-based query languages. Chapter

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Instantiation and Implementation of a Corpus Query Lingua Franca Master Thesis by

The present thesis introduces KoralQuery, a protocol for the generic representation of queries to linguistic corpora. KoralQuery defines a set of types and operations which serve as abstract representations of linguistic entities and configurations. By combining these types and operations in a nested structure, the protocol may express linguistic structures of arbitrary complexity. It achieves ...

متن کامل

Fuzzy linguistic logic programming and its applications

The paper introduces fuzzy linguistic logic programming, which is a combination of fuzzy logic programming, introduced by P. Vojtáš, and hedge algebras in order to facilitate the representation and reasoning on human knowledge expressed in natural languages. In fuzzy linguistic logic programming, truth values are linguistic ones, e.g., VeryTrue, VeryProbablyTrue, and LittleFalse, taken from a h...

متن کامل

Developing Coherent Multiagent Systems using JAFMAS

The Java-based Agent Framework for Multi-Agent Systems (JAFMAS) provides a generic methodology for developing speech-act based multiagent systems (MAS) along with a set of classes to support implementing these agents in Java. The methodology follows five stages: (i) agent identification, (ii) definition of each agent’s conversations, (iii) determining the rules governing each agent’s conversati...

متن کامل

A Wish List for the Behaviour Part of LOTOSVersion 2 . 0 Hubert GARAVEL

This document supersedes a previous Afnor contribution dated from June 1994 and entitled \Six improvements to the process part of Lotos". We propose nineteen changes, which aaect the syntax, static semantics and/or dynamic semantics of the behaviour part of Lotos. These changes aim at solving several problems found in Lotos and making the behaviour part of E-Lotos expressive, simple, symmetric ...

متن کامل

Querying Linguistic Trees

Large databases of linguistic annotations are used for testing linguistic hypotheses and for training language processing models. These linguistic annotations are often syntactic or prosodic in nature, and have a hierarchical structure. Query languages are used to select particular structures of interest, or to project out large slices of a corpus for external analysis. Existing languages suffe...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005